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Abstract. We study optimal risk sharing among n agents endowed with distortion risk 
measures. Our model includes market frictions that can either represent linear transaction 
costs or risk premia charged by a clearing house for the agents. Risk sharing under third- 
party constraints is also considered. We obtain an explicit formula for Pareto optimal 
allocations. In particular, we find that a stop-loss or deductible risk sharing is optimal in 
the case of two agents and several common distortion functions. This extends recent result 
of Jouini et al. (2006) to the problem with unbounded risks and market frictions. 

1. Introduction 

Many financial problems involve transfer of risk among agents. Two noteworthy examples 
are insurance markets and the general equilibrium theory of stock prices. In such problems, 
n > 2 agents with risky endowments (or loss exposures) Xj for i = 1, 2, . . . , n are interested 
in devising an optimal re-allocation of their risks. Let X = Ym=\ total exposure of 

the n agents, and let Vi be the subjective valuation (preference) functional of the i-th agent. 
Consider the collection of allocations of the loss X, namely 

n 

A{X) ^ {Y := (Fi, Y2,...,Yn):X^Yl ^^(^^) 

i=l 

The risk sharing problem consists in finding an optimal allocation Y* G A{X), namely an 
allocation such that (i) Y* is Pareto optimal, that is, no agent can be made strictly better 
off without another agent being made strictly worse off; and (ii) Y* satisfies a rationality 
constraint, that is, all agents are at least as well off under Y* as under the initial exposures 
X = {Xi,X2, . . . , Xn). The latter feasibility constraint is motivated by the assumption that 
only an irrational agent would enter into a contract that made the agent (strictly) worse off. 

The key ingredient in the above problem are the preference functionals Vi, and accordingly 
the optimal risk sharing literature has evolved as new theories of risk have been developed. 
Pioneering work was carried out in the 1960s by Borch [7] and Arrow [3] who showed that 
deductible insurance is optimal under concave risk preferences, specifically, when Vi are 
represented by von Neumann-Morgenstern utility functions. Later research studied the case 
of the dual theory of risk of Yaari [40] and Choquet expected utility theory [12]. Very 
recently, research has focused on risk preferences given in terms of convex risk measures [18]. 
In particular, Barrieu and El Karoui [4] studied optimal risk sharing under the exponential 
indifference measure, while Jouini et al. [25] analyzed the case of two agents and convex, 
law- invariant risk measures. The related question of market equilibrium was addressed in [2] , 
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[11] and [17]. On a more abstract level, Ludkovski and Riischendorf [28] show that Parcto 
optimal allocations are comonotone if the risk measures preserve the convex order. The latter 
structural result allows for some explicit computations, as it permits direct representation 
of possible allocations through the pooling functions. 

A simultaneous strand of the literature has been addressing extensions of the basic in- 
surance problem that take into account market frictions. For example, the fundamental 
problem of adverse selection was initiated by Rothschild and Stiglitz [34] and later further 
discussed in [40] . The effect of transaction costs on optimal contracts was first considered by 
Raviv [30]. Other possible externahties are summarized in the survey articles of Gerber [19] 
and Aase [1] . Many markets also impose constraints on possible risk transfers. Often, only a 
limited set of risk instruments is a priori given, so that risk sharing must belong to the span 
of available contracts (as studied by Filipovic and Kupper [16]). Alternatively, the amount 
of risk transfer is limited by regulator authorities; for instance in the classical insurance 
problem the insurer may be able to take on only part of the total risk due to risk capital 
regulations. The latter problem, which we call risk sharing under constraints, introduces 
effectively n + 1 players into the model, namely n original participants, plus the additional 
regulator that imposes hmits on allowable risk exposures of each participant. The special 
case of Value-at-Risk constraints was recently analyzed in Bernard and Tian [6]. 

This article extends previous results in these two directions by studying optimal risk 
sharing in the context of distortion risk measures, transaction costs and/or third-party con- 
straints. Distortion risk measures lie at the junction of actuarial and financial applications, 
being related both to the dual theory of risk and coherent risk measures. The transaction 
costs in our model have a dual nature and can either represent genuine transaction fees aris- 
ing due to verification, accounting and other inter-agent costs, or the risk-loaded premium 
charged by the insurer. For the constraints, we consider a general set of restrictions given in 
terms of distortion risk measures. 

Our main result, namely Theorem 3, shows that in all of the above cases, the optimal 
risk allocation consists of a collection or "ladder" of deductible contracts. This result can be 
interpreted as an economic justification for the tranche contracts one observes in practice, in 
particular, in credit and reinsurance markets. Moreover, using the quantile representation of 
distortion risk measures we are able to explicitly characterize Pareto optimal contracts under 
transaction costs and/or constraints. In turn, this allows us to present several completely 
worked-out examples of optimal risk sharing under some common risk measures, such as 
Average Value-at-Risk. 

In terms of related hterature. Theorem 3 is an extension of the results of Jouini et al. [25] 
to the multi-agent case with transaction costs and constraints. Compared to their abstract 
approach based on convex duality an inf-convolution, our method is more elementary and 
direct and provides a clearer insight into the problem structure. On a more general note, this 
paper aims to underscore the usefulness of distortion risk measures that have been arguably 
under- appreciated by the financial/mathematical economics community [14]. In contrast to 
the classical expected utility theory, this new framework is driven by two factors. First, it 
postulates cash-equivariant preferences that are appealing based on the normative observa- 
tion that guaranteed cash payments should not affect risk attitudes. Secondly, distortion risk 
measures attempt to mirror business practices where various Value-at-Risk (VaR) method- 
ologies have emerged as the tool of choice. In particular. Average Value-at-Risk (AVaR) 
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has been gaining practitioner acceptance and also happens to be a canonical example of our 
model. 

This paper is organized as follows: In Section 2, we define the setting in which the n agents 
seek a Pareto optimal risk exchange. In Section 3, we obtain the class of Pareto optimal risk 
exchanges in our model. This is then generalized to the constrained setting in Section 4. We 
focus on the case of two agents in Section 5, while interpreting one agent as an insurer and 
another as a buyer of insurance. In this simplified setup we present fully solved examples, 
including examples with explicitly computable deductibles. In Section 6, we provide another 
illustration of our results by considering a single-agent minimization by a buyer of insurance 
who faces a regulator constraint on the possible indemnity contracts. Section 7 concludes 
the paper. 

2. Model for Risk Sharing 

2.1. Distorted Probabilities. Consider the collection of a.s.-finite random variables V — 
{Y : P[— oo < y < oo] = 1} on a probability space (Q, P). As usual, we denote by L°° C V 
{L^ C V) the collection of all a.s. bounded (respectively integrable) random variables. 

Definition 1. Two random variables Y and Z are said to be comonotone if 

{Y{ij,) - Y{ij2)){Z{ij,) - Z{u2)) > 0, (1) 
F{duJi) X F{duj2) -almost surely. In other words, Y and Z move together. 

An equivalent definition of comonotonicity is that there exists a random variable V & V 
and non-decreasing functions fy and fz such that Y — fyiV) and Z = fz{V) almost surely 
[15]. Another equivalent definition is that there exist non-decreasing functions hy and hz 
such that hy{x) + hz{x) — x, Y — hy{Y + Z), and Z — hz{Y + Z) almost surely. 

Definition 2. A function H : V is called a law- invariant, comonotone, monetary risk 
measure (or distortion risk measure) if H satisfies the following five properties: 

(a) H{Y) depends only on the law ofYEV. 

(b) H is m,onotone in the natural order ofV. 

(c) H is cash equivariant: H{Y + a) = H{Y) + a for any a G M. 

(d) H is subadditive in general and additive for comonotone risks: For Y,ZeV, 

H{Y + Z) <HiY) + H{Z), (2) 

with equality for any Y, Z comonotone. 

(e) H is continuous: For Y eV, 

lim H[max{Y,d)]= H{Y), (3a) 
lim H[max{Y - d,0)] = H{Y), ifY>0, (3b) 
lim H[mm{Y, d)] = H{Y). (3c) 

d— »oo 

The above axioms are justified by basic economic principles as applied to insurance; see 
[14, 25, 38]. Because we are interested in risk sharing, cash equivariance is a desirable 
property because receiving fixed payments (at least within a reasonable range) should not 
affect attitudes towards risk. The comonotone additivity property represents inability to 
diversify risks that always move in the same direction. The continuity property (e) is for 
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technical reasons, althongh it was shown by [24] that viewing if as a map on L°°(P), (e) is 
automatically implied by (a)-(d). 

Denote by Sy the (decumulative) distribution function of Y, that is, Syit) = F(Y > t), 
and by Sy'^ the (pseudo-) inverse of Sy, which is unique up to a countable set [15]. For 
concreteness, take 'S'y^(p) = sup{t : Sy{t) > p}. The inverse Sy^ thus defined is right 
continuous; if one were to desire left continuity, then replace > with >. 

We recall that any distortion risk measure admits the following representation, which 
essentially follows from Greco's representation theorem [21]: 

Theorem 1. ([15], [38, Appendix A]) Let H be a distortion risk measure. Then, there exists 
a non- decreasing, concave function g : [0, 1] [0, 1] such that g{0) = 0, g{l) = 1, and 

H{Y) = jYd{goF) = £ Sy'ip) dg{p) (4) 

/O foo 
{g[Sy{t)] -l)dt+ g[Sy{t)] dt, VF e V. 
-oo Jo 

We write Hg for H when we want to specify the particular function g in (4). The function 
g is called a distortion because it modifies, or distorts, the tail probability Sy. Observe if 
g{p) — p, then Hg{Y) = KY. For this reason, Hg is also referred to as an expectation with 
respect to a distorted probability. Note that at this stage we allow Hg to take ±oo as a 
value. 

We assume that each agent orders random variables in V by using a distortion risk measure 
Hg, where Y is preferred to (that is, less risky than) Z by the agent if Hg(Y) < Hg{Z), and 
we pursue this topic in the next section. For more background on such risk measures Hg, 
see Yaari [39] who discusses evaluating random variables in a theory of risk that is dual to 
expected utility. Two noteworthy examples of distortion risk measures are (1) the Average 
Value-at-Risk at level 1 — (AVaR) obtained by taking g{p) = min(Q;p, 1) for some a> 1 
and (2) the proportional hazards transform g{p) — p'^ for some < c < 1. 

Remark 1. It has been shown [13, 26] that any distortion risk measure is a weighted average 
of the AVaR. Namely, define AVaRa{Y) as above. Then, any comonotone law-invariant 
coherent risk measure on V can be written as 

H{Y)^ [ AVaRa{Y)ii{da), 
Jo 

for some probability measure /i on [0, 1]. For this reason, [13] calls a distortion risk measure 
Weighted VaR. 

Remark 2. Since a distortion risk measure is a special case of a coherent risk measure, one can 
also obtain a representation of H in terms of penalized expectations, H{Y) — supq^-jj'EqIY], 
for the set T> of probability measures called the core of o P, and Y & [15, Proposition 
10.3]. For more results in this direction see [18]. 

Definition 3. Y is said to precede (or be preferred to) Z in convex order if Jq Sy^{p) dp < 
lo '^z^ip) dp /'^^ (^^^ 1 ^ [0' 1] equality at q = 1. We write Y <cx Z. 

Note that convex order is equivalent to ordering with respect to second stochastic dom- 
inance with equal means [31, 32, 33]. For later use, recall that Hg satisfies the following 
properties for y e P (see [37]): 
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(a) Positive homogeneity: If a > 0, then Hg{aY) — aHg{Y). Note that positive ho- 
mogeneity and subadditivity imply that H is convex, that is, H[XY + (1 — X)Z) < 
\H(Y) + (1 - \)H{Z) for aU A G (0, 1). 

(b) Duahty: Hg{—Y) = —Hg(Y), in which g is the dual distortion of g given by g{p) = 
l — g{l —p) for p e [0, 1]. Since g is concave, g is convex. The dual Hg can be thought 
of as a monetary utility function that measures attitudes towards wealth levels; see 
[24]. 

(c) Convex ordering: Because g is concave, Hg preserves <cx, that is, if Y <cx Z then 
Hg{Y) < Hg{Z). In particular, because WY <cx Y, then EY = HgiEY) < Hg{Y). 

(d) Non-excessive loading: H(Y) < ess sup y. 

2.2. Economic Objective. Suppose agent i faces a random loss Xi before any risk exchange 
for z = 1, 2, . . . , n. If the collection of agents trades the original allocation X for the allocation 
Y e A{X), then the random loss or payout, including transaction costs, of agent i becomes 

Zi^Yi + {ai + hYi + CiEYi) = (1 + hi)Yi + a, + aEYi. (5) 

The additive factor a, > is a fixed cost associated with transferring the risk to the 
coalition of agents (or to a central clearing house); for example, could be the premium 
that the agent pays to the coalition to eliminate the risk Xj. The multiplicative factor 
6i > represents costs associated with the actual size of the random loss 1^, for example, 
investigative costs that could increase proportionally with the size of the loss. The factor 
Cj e M represents costs that reflect the expected size of the payout Y^, for example, hiring 
claim administrators; q is also net of any premium that the agent receives in exchange for 
accepting the risk Fj, if the premium equals (1 -|- 0)KYi as in [3]. In fact, we might wish to 
say that q = —(1 + 6'), that is, all of this part of the cost function arises from premium 
received. We explore this in examples later in the paper, as well as at the end of this section. 

Agent i, for i = 1, 2, . . . , n, seeks to minimize Hg^{Zi) for some concave distortion function 
gi. Note that minimizing 

Hg^ (Zi) = Hg^{{l + bi)Yi + ai + CiEYi) = (1 + bi)Hg^ {Y^} + + qEF, (6) 

is equivalent to minimizing 

Vi{Yi) := (1 + bi)HgXYi} + CiEYi. (7) 

In light of this recasting of agent i's goal, a Pareto optimal risk exchange is defined as 
follows: 

Definition 4. X* e A{X) is called a Pareto optimal risk exchange or allocation if whenever 
there exists an allocation Y G A{X) such that Vi{Yi) < Vi{X*) for all i — 1,2, ... ,n, then 
ViiYi) = Vi{X*) for all I ^ 1,2,..., n. 

In other words, there is no way to make any agent (strictly) better off without making 
another agent (strictly) worse off. 

We assume that the initial allocation carries finite risk, that is, Hg^[Xi) is finite for i = 
1,2,. . . ,n. Therefore, there exists at least one allocation Y, namely X itself, such that Vi{Yi) 
is finite for alH = 1, 2, . . . , n. 

We end this section by discussing the rationality constraint mentioned in the Introduction. 
In order that the allocation Y G A{X) be feasible (regardless of whether it is Pareto optimal), 
it must be true that each agent is at least as well off under Y as under the original allocation 
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X. That is, the following inequality must hold for each i — 1,2, ... ,n: Hg.(Xi) > Vi{Yi). 
We assume that the set of feasible allocations in A{X) is non-empty. 

When first presenting the cost function Gi + biYi + CiEYi in connection with equation (5), we 
proposed that one might wish to consider the last term as representing premium received in 
exchange for accepting the risk Fj. In that case, write the premium as —CiEYi = (1 + 9)'EYi, 
so that the rationality constraint becomes 

(1 + e)EYi > a, + (1 + bi)HgXY^ - HjXi). (8) 

One can interpret the left-hand side of inequality (8) as the minimum premium that agent 
i is willing to accept for replacing Xi with Yi. Therefore, the rationality constraint holds in 
this case if the premium received is at least as great as the risk-adjusted cost, as measured 
by the right-hand side of (8). 

3. Pareto Optimal Allocations 

To describe the Pareto optimal allocations, we begin with a series of lemmas. In the first 
lemma, we show that if the 1 -|- 6^ -|- Cj's are of different signs or if one of them is zero and 
the other is non-zero, then no Pareto optimal allocation exists. 

Lemma 1. Suppose there exist i,j — 1,2, ... ,n such that 1 -|- 6, -|- q 7^ and (1 -|- 6^ -|- Cj) 
hj -\- Cj) < 0, then no Pareto optimal allocation in A{X) exists. 

Proof. Without loss of generality, suppose that H- 61 -|- ci < and H- 62 + C2 > 0. Consider 
any Y G A{X). Then, Z = (Yi + 1, 1^2 — 1, ^3, • • • , Yn) is a strict improvement on Y because 
Vi{Z,) = Vi(Yi) + {l + bi + ci) < ViiXi) and 1^2(^2) = ^^2(1^2) - (1 + &2 + C2) < ^^2(1^2). Thus, 
there exists no Pareto optimal allocation in A{X). □ □ 

For the present, we skip the case in which all 1 + 6, + q = for i = 1, 2, . . . , n; we consider 
it more fully for the case of n = 2 in Section 5. The next lemma is straightforward, but we 
include its proof for completeness. 

Lemma 2. //X* = (Xj*, X2 , . . . , X*) e A{X) is Pareto optimal, then so is (Xj", , . . . , X*+ 
I3,...,XI- P,..., X*) e ^(X) for any /3 e R and any j,k ^ 1,2, ... ,n. 

Proof. Let X* = {X1,X2, . . . ,X*) e A{X) be Pareto optimal. Suppose Y e ^(X) is such 
that Vj{Y,) < Vj{X* + f3), Vk{Yk) < \4(X* - f3), and V^iY^ < V,iX*) for t ^ j,k. We 
want to show that equality holds in each case. Inequality Vj(Yj) < Vj{X* + /?) implies that 
Vj{Yj) < Vj{X*) + (1 + bj + Cj)l3, from which it follows that Vj{Yj - /3) < Vj{X*). Similarly, 
Vk{Yk) < Vk{Xl - (3) implies that Vfc(Ffe + (3) < Vfc(X^). Note that the allocation Y' defined 
by = Yj - (3, YI ^Yk + f3, and Y/ = Yi for ij^j,k is in ^(X). Therefore, by the Pareto 
optimality of X* we have V,iY, - /5) = V;(X;), i4(n + P) = VtiXl), and = V,{X*) 

for i =^ j,k, from which it follows that Vj{Yj) = Vj{X* + P), Vfc(Ffc) = Vk{Xl - P), and 
K(K,) = Vi{X*) for I 7^ J, k. Hence, (X*, X*, . . . , X* + /3, . . . , X* - /3, . . . , X;;) is Pareto 
optimal. □ □ 

It follows from Lemma 2 that without loss of generality, we can assume that a Pareto 
optimal allocation assigns the loss to each of the n agents when the total loss X is 0. 
If this particular Pareto optimal allocation does not satisfy the rationality constraint in 
inequality (8), then we can modify the allocation by constants (that sum to zero) so that the 
rationality constraint is satisfied. (Recall that we assume that the set of feasible allocations 
is non-empty, so there exist such constants.) 



OPTIMAL RISK SHARING UNDER DISTORTED PROBABILITIES 



7 



Consider the mapping F : A{X) M" given by F(Y) = (^1(^1), 1^2(12), ■ ■ ■ , Vn{Yn)). We 



can partially order the points in M."' as follows: 

Definition 5. For x, y e M", we write x < y if Xi < yi for i — 1,2, . . . ,n. 

The next lemma, whose proof is immediate from the definition of Parcto optimality in 
Definition 4, shows that the Pareto optimal points in A{X) correspond to the minimal 
points in the image of F in M". 

Lemma 3. //X* e A{X) is Pareto optimal, then F(X*) e im{F) is minimal. Conversely, 
z/x e im{F) is minimal, then there exists X* e ^{X) with F(X*) = x, such that X* is a 
Pareto optimal allocation. 

We next use Lemmas 2 and 3 to characterize the set of Pareto optimal allocations when 
we view them as points in R" via the mapping F. 

Theorem 2. Suppose (1 + 6j + Ci)(l + hj + Cj) > for all i,j — 1,2, ... ,n. Then, the image 
of the set of Pareto optimal allocations in A{X) under the mapping F is a hyperplane in R" 
given by 



in which X* e A{X) is any Pareto optimal allocation. Furthermore, one obtains such a 
Pareto optimal allocation X* by minimizing 



overY e A{X). 

Proof. We begin by showing that if X* G A{X) minimizes the expression in (10), then X* 
is Pareto optimal. Suppose that Y G A{X) is such that Vi{Yi) < Vi{X*) for i = 1,2, ... ,n. 
Then, J27=iViiYi)/\l + 6^ + q| < J27=iViiX*)/\l + 6^ + q|, from which it follows that 
Er=i + bi + Ci\ = J27=i + + Ci\ because X* minimizes (10). Therefore, 

ViiYi) = Vi{X*) for i = 1, 2, . . . and X* is Pareto optimal. 

Next, suppose x G satisfies the equation of the hyperplane (9) for some Pareto optimal 
allocation X* G A{X). Define pi := {xi - Vi{X*))/ll + bi + q) for i = 1,2, ... ,n; then, 
Er=i A = 0. Define X* := {X* + Pi, X^ + P2, ■ ■ ■ , X*+ (3^) & AiX). By the same argument 
as in the proof of Lemma 2, one can show that X* is Pareto optimal. Finally, F(X*) = 

(Ki(X*+A), V2{X;+P,), Vn{X:+P^)) = F(X*) + (A(l+6i + Ci),/32(l+62 + C2), . . . , Pn{l + 

b„ + c„)) = F(X*) + (xi - Vi{Xl).X2 - V2{X;), .... ,T„ - K(X;J) = x. Thus, (any) x in (9) 
is an image of a Pareto optimal allocation in A{X) via the mapping F. As an aside, note 
that all elements of the hyperplane (9) give the same minimum value in the expression (10). 

To complete the proof, we need to show that the hyperplane (9) gives us all the Pareto 
optimal allocations. Suppose not; suppose that there is a Pareto optimal allocation Y* G 
A{X) that is mapped to a point not on the hyperplane (9). Then, by the argument in 
the above paragraph, any point y G R" that satisfies Yl^=i{^i0^i*) ~ vdl (1 + + q) = 
is the image of a Pareto optimal allocation. Thus, we have two parallel hyperplanes both 
purporting to be the image (under the mapping F) of Pareto optimal allocations in A{X). 
By Lemma 3, only one of these hyperplanes will be minimal, a contradiction. Thus, the 
Pareto optimal allocations in A{X) correspond to points in the hyperplane (9). □ □ 





n 




(10) 
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To describe Pareto optimal allocations corresponding to points in the hyperplane (9), it 
is easier to consider comonotone allocations. 

Definition 6. An allocation Y e A{X) is called comonotone if and X are comonotone 

for i = 1,2, ... ,n. 

Note that if Y is a comonotone allocation then any two Yi and Yj arc also pairwise 
comonotone. Ludkovski and Riischcndorf [28, Proposition 1] shows that for Vi preserving the 
convex order, any integrable non-comonotone allocation X e A{X), Xi e L^(P) is dominated 
by some comonotone X*, Vi{Xl) < Vi{Xi), i = 1,2, ... ,n. This result is essentially based 
on the comonotone <ca;-iniprovement result of Landsberger and Mcilijson [27]. Note that 
the requirement X^ G is automatically satisfied since we already assume that KXi < 
Hg-{Xi) < OO. Thus, Pareto optimal allocations are comonotone. 

For a comonotone allocation X = f'2.{X), . . . , fn{X)), Denneberg [15, Proposition 

4.5] shows that the functions fi are continuous on supp{X) for i = 1,2, ... ,n. Moreover, 
he shows that fi may be extended to continuous functions on the entire real line such that 
Yl!i=i fi{x) = X x e M. It follows that we can restrict our attention to finding Pareto 

optimal allocations in 

C{X) 4 . . . , /„(X)) e A{X) : 

n 

fi cont., non-decreasing, — x for x e R}. (11) 

i=l 

Comonotonicity implies that an optimal risk allocation necessarily satisfies the mutuality 
principle, whereby the share of each agent depends only on the total risk X. We now use 
the above results to exphcitly characterize the Pareto optimal allocations. 

Theorem 3. Suppose (1 -|- 6j -|- Ci)(l + bj + Cj) > for all i,j — 1, 2, . . . ,n. Then, X* = 
{fi{X), /I (X), . . . , f*{X)) e C{X) is a Pareto optimal allocation if and only if 

$:(/;)'(.) ^Iforl^ argmm,.,,..,„ + ^^)^^(^x(^) + .^^(^ , (12) 
and ifiYit) — otherwise. 

Proof. Prom Theorem 2 and [28] , we know that Pareto optimal allocations correspond to min- 
imizers X* e C{X) of the expression in (10). As discussed after the proof of Lemma 2 without 
loss of generality, suppose that the Pareto optimal allocation X* — {f^{X), f2{X), . . . , fn{X)) 
is such that /*(0) = for i = 1, 2, . . . , n. 

Suppose Y — f{X) for a continuous, non-decreasing real- valued function / on R+ with 
/(O) = 0; then, 

(1 + b)H,{Y) + cEY = (1 + ^ Sjl^p) dg{p) + Sj^^^ip) d{p) 



= {l + b) ff [S^\p)] dg{p) + cff [S-^\p)\ d{p) 
Jo Jo 

poo /"OO 

= (1 + 6) / g [Sx{t)] df{t) + c / Sx{t) df{t) 
Jo Jo 

roo 

= / [{l + b)g + c]{Sx{t))df{t), 
Jo 



(13) 
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in which the function (1 + b)g + c is defined on [0, 1] by [(1 + b)g + c]{p) = (1 + b)g(p) + cp. 
Thus, minimizing expression (10) is equivalent to minimizing 



n „c 



[(l±^)ft+^d(&Mrf/,(t,, (14) 
\l + bi + cA 



which is minimized by setting Ylii&iUi )' {^) = 1 ^'^^ 

X = argminfc^i^2,...,n {(1 + h)gk{Sx{t)) + CkSx{t)} + bk + Ck\, and by setting {f*)'{t) = 
otherwise. □ □ 

The above theorem imphes that under a Pareto optimal allocation, the risk sharing consists 
of "tranches" where the risk of each tranche is entirely borne by one agent (ignoring equality 
in the argmin). As expression (12) shows, the optimal allocation Y* of the i-ih. agent consists 
of a series of laddered European options on the total risk X. Hence, agent i assumes total 
responsibility for risk levels where fi{S^{t)) = 1, and receives full insurance otherwise. Such 
risk sharing arrangements are observed in practice in credit derivatives, where the total risk 
X represents a bond portfolio subject to default risk and the corresponding risk is allocated 
via credit tranches. These credit tranches can be viewed as optimal insurance contracts for 
a set of representative investors with varying risk measures. 

Remark 3. The problem considered in this section has a long history in the context of 
reinsurers determining the best way to allocate risk among them. Borch [7] shows that 
if the reinsurers seek to maximize their expected utility of wealth, then the allocation is 
related to their absolute risk aversions, in which the absolute risk aversion associated with a 
utility function u is —u"/u'. Biihlmann [8, 9] extends Borch's work by developing premium 
rules associated with such risk sharing. The connection between second order stochastic 
dominance and optimality of deductible insurance was already noted in [23] and [20]. 

Remark 4. Theorem 2 and the reduction to comonotone allocations are key steps in our 
argument since they dramatically simplify the structure of Pareto optimal allocations. Note 
that the only property used in the proof of Theorem 2 was the cash equivariance of the 
corresponding risk measures, while the only property used in relation to the comonotonicity 
improvement of Proposition 1 in Ludkovski and Riischendorf [28] was consistency of H and 
<cx- On the other hand, Bauerle and Miiller [5] show that any law-invariant convex risk 
measure, subject to a mild continuity requirement, is consistent with the convex order <cx- 
We, therefore, hypothesize that the conclusion of Theorem 3 will hold for arbitrary law- 
invariant convex risk measures. This conjecture would further extend the setting of Jouini 
et al. [25]. 



4. Constrained Risk Sharing 

We next consider the related situation for which the risk sharing is subject to regulation. 
This may arise, for example, in an insurance setting where the risk transfer from buyer to 
insurer is controlled by a government regulator, or in a financial setting where the party 
taking on risk is subject to a risk management framework, such as Basel II. 

The effect of such regulation is to impose further constraints upon some of the Fj's in (10). 
This of course modifies the resulting Pareto optimal allocations since some of the possible 
optima become infeasible under the constraint. A similar model was studied by Bernard and 



OPTIMAL RISK SHARING UNDER DISTORTED PROBABILITIES 



10 



Tian [6] under the assumption of a VaR constraint. In our framework where we work with 
distortion risk measures, we instead postulate constraints of the form 

HhAYi)<Bi, i = l,2,...,n, 

in which H^- is the regulator's (convex) risk measure on the final risk transfer amount Y^, 
and Bi is the corresponding risk threshold for agent i. 

We modify the set of allocations A{X) to account for these constraints. Define the set of 
constrained allocations by 

n 

A%X) 4 {Y := (n, Y2,...,Y^):X = Y, y^, Vi{Yi} finite, H„^{Yi) < B^}. 

i=l 

We assume that the set of feasible allocations in A'^{X) is non-empty. Analogous to Defini- 
tion 4, X* e A.'^{X) is a constrained Pareto optimal allocation if whenever there exists an 
allocation Y G A^iX) such that Vi{Yi) < Vi{X*) for alH = 1, 2, . . . , n, then Vi{Yi) = Vi{X*) 
for alH = 1,2, ... ,n. 

The next lemma shows that as in Section 3 for unconstrained Pareto optimal allocations, 
without loss of generality we can restrict our attention to constrained Pareto optimal allo- 
cations that are comonotone. 

Lemma 4. // Y e A'^{X), then there exists Y' e C{X) fl A'^{X) that improves it in the 
partial ordering of Section 3. 

Proof. Ludkovski and Riischendorf [28, Proposition 1] show that given an arbitrary allocation 
Y G A^{X) C A{X), there is a comonotone improvement in the stochastic convex order 
Y' G C{X), that is, Y- <cx Yi for i = 1,2, . . . ,n. Therefore, the allocation Y' improves Y 
in the partial ordering of Section 3 because Vi preserves the convex order for i = 1, 2, . . . , n. 
Moreover, because Hf^ also preserves the convex order, it follows that if^. (Y-) < Hh^ (Yi) for 
i = 1, 2, . . . , n and Y' G A^X) is still feasible. Thus, Y' G C{X) n A^iX). □ □ 

Note that if the constraining risk measure is not convex, then optimal allocations might 
not be comonotone. For instance, a VaR constraint at level a% corresponds to the non- 
concave distortion function h{p) = l{p>a}- Such is not consistent with the <ci-order, 
and therefore Lemma 4 does not apply. Indeed, as explicitly shown by Bernard and Tian [6] , 
the resulting optimal allocation might fail to be comonotone. 

By using Lemma 4, we reduce the constrained problem to the same situation as in Theorem 

3. 

Theorem 4. The optimal risk allocation for the constrained problem is obtained by finding 
minimizers of 



n 



ri(lii!#iiMi±^J(^d/.w. (15) 

^_yo \l + bi + Ci + Xi\ 

in which Xi >0 is a Lagrange multiplier for the i-th constraint, for i = 1,2, . . . ,n. 

It follows from Theorem 4, that we, again, will obtain a ladder-like optimal contract structure, 
similar to the tranches in (14). Many cases are possible with respect to which of the Aj's 
are positive (that is, the respective constraint binds) versus zero. In particular, a variety of 
degeneracies might arise if several constraints bind simultaneously. Instead of considering 
all these cases for an arbitrary n, in Sections 5.3 and 6 we focus on a simple example with 
two agents and one constraint, a setting already taken up in [6]. 
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5. The Special Case of n = 2 Agents 

In this section, we specialize our results to the case for which we have two agents. Suppose 
an individual (agent 2) is facing an insurable random loss X2 = X and wants to buy insurance 
f{X) for all or part of the loss X from an insurer (agent 1 with Xi = 0). In this case, our 
problem amounts to finding a Pareto optimal allocation (/*(X),X — /*(X)), in which f*{X) 
is the insurer's share of the risk X, and X — f*{X) is the amount of the risk retained by the 
individual. Arrow [3] showed that if the premium equals (1 +6)E,f{X) with ^ > and if the 
individual seeks to maximize his or her expected utility of wealth, then f*{X) is deductible 
coverage (that is, f*{X) is given functionally by f*{x) = (x — 0?)+ for some d>0, in which x 
is a specific value of the random loss X). One could view this risk exchange as Pareto optimal 
if the insurer's goal were to maximize its expected profits (among other possible criteria). 
For more recent work in the area of optimal insurance, see Promislow and Young [29] who 
extended the work of Arrow to other premium rules and optimality criteria. 

We first examine the case for which 1 + 61 + ci = = 1 + 62 + C2. Then, we consider the 
case for which (1 + 61 + ci)(l + 62 + C2) > 0. 

5.1. 1 + 61 + Ci = = 1 + 62 + C2. In this case, we have Ci = —(1 + bi) and C2 = —(1 + 62)- It 
follows from arguments similar to those in Section 3 that the Pareto optimal risk exchanges 
are given as the minimizers over Y e C{X) of the following expression as Ai and A2 range 
over the non-negative reals: 

Ai(l + h) [Hg,{Y,) - EY,] + A2(l + 62) [Hg,{Y2) - EY2] , (16) 

with at least one of Ai and A2 strictly positive. Without loss of generality, suppose Ai > 0. 
Also, note that Y2 = X — Yi, let f{X) denote Yi. Then, the Pareto optimal risk exchanges 
are the minimizers over real- valued, continuous, non-decreasing functions /, with Hg.{f{X)) 
finite for i = 1, 2, of the following expression as 5 ranges over the non- negative reals: 

[H,,UXX)) - Ef{X)] + 6 [EfiX) - H,,U{X))] . (17) 

Without loss of generality, we can assume that /(O) = 0; otherwise, define / by f{x) — f{0) 
and note that i/,,(/(A))-E/(X) = if,^(/(X)-/(0))-E(/(A)-/(0)) = i7,,(/(A))-E/(X) 
for i = 1,2. 

By following the argument of Theorem 3, the derivative of the optimal function /* is given 

by 

r 1, if giiSxit)) - Sxit) < 6 MSxit)) - Sxit)] ; 
(r )'(t) = h, if 9,{Sx{t)) - Sx{t) = S [g2{Sx{t)) - Sx{t)] ; (18) 
I 0, otherwise, 

in which (3 e [0, 1] is arbitrary. If we interpret g{Sx{t)) — Sx{t) as the marginal cost of 
adding more risk (except for the factor of 1 -|- 6) , then /* increases if the marginal cost for 
the insurer is less than the marginal cost for the buyer adjusted by the factor 5 >0. 

Note that if < 5i < ^2, then < /l^, in which corresponds to the minimizer of (17) 
for 6 = 6i, i = 1,2. In other words, as the weight given to buyer's risk preference increases, 
then the insurer assumes more of the risk. 

In the special case for which S — 0, we seek to minimize Hg-^{f{X)) — Ef{X) which is 
greater than or equal to because gi is concave. Thus, Hg^{f{X)) — Ef{X) is minimized 
by /* = r for any constant r. \l gi is strictly concave, then this expression is minimized 
uniquely (up to an additive constant) by /* = 0. If gi is not strictly concave, then for 
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illustrative purposes, suppose gi is given by AVaR, specifically gi{p) = min{ap, 1) for some 
a > 1. Then, for X ~ Bernoulli{q) for some q> l/a, the function / given by /(O) = r and 
/(I) = r + 1 is such that Hg^{f{X)) - E/(X) = for any r e M. That is, if gi is not strictly 
concave, then the minimizer of Hg^{f{X)) — E/(X) is not necessarily unique. 

In general, if the distortions are not strictly concave, then it is possible that gi{Sx{t)) — 
Sxit) = S [g2{Sx{t)) — Sx{t)] on a set of positive measure, in which case, /* will not be 
unique. 

We leave the case for which 1 + 6i + ci = = 1 + 62 + C2 because as the reader will see in 
the next section, the conclusions that we could draw further from equation (18) are similar 
to the ones we will draw from equation (21) below. 

5.2. (l + 6i + ci)(l + 62 + C2) > 0. Let f{X) be the random indemnity that the insurer (agent 
1) pays to the buyer (agent 2) in exchange for a premium of (1 + 9)Kf{X) for some ^ > 0, 
with f{X) and X — f{X) comonotonc. 

For concreteness, in the notation of this paper, set ai = 0, 61 > 0, Ci = — (1 + 6) and 
a2 = (1 + 9)EX, 62 = 0, C2 = -(1 + 9). Thus, the condition (1 + 61 + Ci)(l + 62 + C2) > is 
equivalent to 61 < ^. 

Under these values for the parameters, the rationality constraint for the insurer in (8) 
becomes 

{i + e)Ef{x) > {i + h)Hg,{f{x)y, (19) 

that is, the insurer is willing to enter into a contract for which the premium (1 + 6)E,f{X) is 
at least as great as the risk-adjusted cost, as measured by {l + bi)Hg^{f{X)). The rationality 
constraint for the buyer becomes 

Hg,{f{x))>{i + e)Ef{xy, (20) 

that is, the risk-adjusted benefit for the buyer from receiving f{X) is greater than the cost 

{i + e)Ef{x). 

It is reasonable to assume that the buyer is "more risk averse" than the insurer in the 
sense that the buyer's distortion function is a concave transformation of the insurer's, or 
equivalently, g2 > gi- Theorem 3 then implies that the optimal function /* is given by 

r 1, if g,{Sx{t)) - Sx{t) < ^ [92{Sx{t)) - Sx{t)] ; 
irnt) =h, if g^{Sx{t)) - Sxit) = ^ MSxit)) - Sxit)] ; (21) 
[0, otherwise. 

in which f3 G [0, 1] is arbitrary. The function /* in equation (21) is similar in form to the one 
given in (18), with the arbitrary 6 >0 replaced by the fixed < {9 — bi)/{9{\ + bi)) < 1. 

Prom the expression in (21), we can deduce several conclusions. Because {9—bi)/9 increases 
as 9 increases, the optimal insurance /* increases as the proportional risk loading 9 increases. 
Also, because {9—bi) / (l+^i) decreases as 61 < ^ increases, the optimal insurance /* decreases 
as the insurer's cost 61 increases. This makes sense because if the proportional cost of the 
insurer increases, as measured by 61, then the insurer is willing to sell less insurance to the 
buyer. 

If g2 is replaced by a concave distortion §2 ^ g2, then /* increases because g2iS x it)) — S x it) 
increases. In other words, as the buyer of insurance becomes more risk averse, then the buyer 
is willing to purchase more insurance at a given price. 
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We have the following proposition that tells us when the optimal insurance is deductible 
insurance. We omit its proof because it is a straightforward application of the expression in 
(21). Recall from the discussion following Lemma 2 that without loss of generality, we can 
assume that /* (0) = 0, and we do so in this proposition. 

Proposition 1. //(fi'i(p) —p)/{92{p) — p) increases for p G (0, 1), then deductible insurance 
is optimal, that is, 

rix) ^{x- dU (22) 
is optimal with the deductible d given by 

lS2(^x(t))-^x(t)-^(l + Mi- ^ ^ 

// no such d exists, then f* = 0. 

Note that if{9iip)-p)/i92ip)-p) increases for p e (0, 1), then {gi{Sx{t))-Sx{t))/{g2{Sx{t))- 
Sx{t)) decreases for t > 0. 

Proposition 1 is a generalization of Proposition 3.2 in Jouini et al. [25] who also obtained 
deductible insurance in the context of law-invariant convex risk measures. In contrast to 
the proof presented here, their non-constructive method relies on convex duality and only 
applies in the setting of L°°(P). 

We have three corollaries to Proposition 1 for special cases of distortion functions. First, 
we consider the proportional hazards transform; then, we consider AVaR; finally, we consider 
the dual power distortion. We omit their proofs because they follow directly from showing 
that {giip) — p)/{g2{p) — p) increases on (0, 1). 

CoroUciry 1. If gi{p) — p^^ for < C2 < Ci < 1, then deductible insurance is optimal. 

Moreover, for the proportional hazards transform, {gi{p) —p)/{g2{p) —p) increases from to 
(1 - ci)/(l - C2) < 1. Therefore, if (1 - ci)/(l - C2) < (^ - hi)/{e\l + hi)) then full coverage 
is optimal, which occurs when hi is small enough. However, if 9 is large, then the rationality 
constraint in inequality (20) might not hold, so full coverage (even though optimal) might 
not be feasible. In such cases, we can subtract a fixed amount a > from the coverage 
to make it feasible by the buyer, thereby effectively lowering the benefit and the premium. 
Finally, note that as C2 decreases (that is, as the buyer becomes more risk averse), the ratio 
{gi{Sx{t)) — Sx{t))/{g2{Sx{t)) — Sx{t)) decreases for a given value of t > 0, which implies 
that the deductible decreases (that is, the optimal coverage increases). 

Corollary 2. If gi{p) = min(ajp, 1) for 1 < ai < 0:2, then deductible insurance is optimal. 

For the AVaR distortion, {gi{p) — p)/{g2{p) — p) increases from (ai — l)/(a2 — 1) to 1. If 
(cti — 1)/(q;2 — 1) > (^^ — bi)/{9{l + 61)), then zero coverage is optimal. If 61 > and if 
Sx{0) — 1, then full coverage is never optimal. 

Corollary 3. If gi{p) = 1 — (1 — p)°'* for 1 < di < ^2, then deductible insurance is optimal. 

The dual power distortion is so named because it is the dual to the proportional hazards 
transform. For this distortion, {gi{p) — p)/ (5*2 (p) —p) increases from {di — l)/{d2 — 1) to 00. 
Thus, if (di -1)7(^2-1) > i9-bi)/{e{l + bi)), then zero coverage is optimal. If -S'x(O) = 1, 
then full coverage is never optimal. 

We end this section with two examples in which we show that deductible coverage as 
defined in the narrow sense of equation (22) is not necessarily optimal. 
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and 



Example 1. Define the distortions gi and g2 on [0, 1] by 

o<p< i 

^2(p) = |<P<|, (25) 

ip+i, !<P<i- 

If X ~ £'xp(l), ^ = 1, and bi — 1/3, then one can show that optimal insurance satisfies 

1, 0<t<ln|, 
{f*y{t) = {0, ln|<i<ln3, (26) 
t > In 3. 

In other words, optimal insurance fi exhibits full coverage up to In (3/2) followed by no 
additional coverage until In 3, after which the coverage is full at the margin. Specifically, 
is given by 

{t, 0<t<ln|, 
ln|, ln|<t<ln3, (27) 
t + lnl, t>ln3. 

Example 2. Define the distortions gi and g2 on [0, 1] by 

Ip, o<p<l 

gi{p)-{p + h -4<p<l (28) 
Jp+|, |<p<i. 

and 

. . JIp, 0<p< |, 

92{p) = i 1 ^ 1 1 / ^ : (29) 

If X ~ Exp{l), — 1, and 6i = 1/3, then one can show that the optimal insurance /* paid 
by the insurer is given by f*{t) — t — fi{t) for i > 0, in which is the optimal insurance in 
Example 1. In other words, optimal insurance in this case exhibits a deductible of ln(3/2) 
with a maximum limit, or maximum payout, of In 2. 

5.3. Examples with Constraints. Regulators of insurance often put constraints on insur- 
ance contracts that insurers are allowed to provide in the market. To illustrate the effect 
of constraints on the form of the indemnity contract /, we include two simple examples. 
In both these examples, we follow the model for two agents with bi > 0, &2 = 0, and 
ci = C2 = -(1 + 0). Let 

gi{p) = min(aip, 1), 

g2ip) = min(a2P, 1), 
hi{p) = mm{f3p, 1). 

Agent 1 is the insurer with the AVaR distortion function gi that faces a regulator constraint 
based on the Hh-^ risk measure; agent 2 is the buyer with the AVaR distortion function g2. 



OPTIMAL RISK SHARING UNDER DISTORTED PROBABILITIES 



15 



(30) 



Example 3. In this example, suppose that 0:2 > /3 > cci > 1; that is, the buyer is the 
most risk averse with the insurer being the least risk averse and the regulator somewhere in 
between. The relevant terms in the sum (15) are given by 

Qi{p) = [(1 + bi) mm{aip, 1) - (1 + e)p + Amin(/3p, l)]/|6i + A - ^| 

Q2{p) = [minia^p, 1) - (1 + e)p]/e. 

By Theorem 4, for a given Lagrange multiplier A > 0, the optimal contract satisfies (/'^)'(5'x (t)) 
1 if Qi{p) < Q2{p) and {f^y{Sx{t)) = otherwise. 

In the following, we assume that ^ > A + 61, so the transaction costs are large. The risk 
functions Qi and Q2 are illustrated in Figure 1. We note that for large p ~ 1, Qi(p) > (52(p) 
and moreover, the two piccewise linear functions cross at most once on (0, 1). More precisely, 
if as > (l+^) + ^^i±^^i^f^^J^±f^±^, then for small p ~ 0, gi(j9) < Q2ip), and Qi and Q2 have 
exactly one crossing point < p* < 1. Thus, the optimal contract in that case is deductible 
insurance f^{x) — {x — d)+, as the insurer covers large risks (small p) and the buyer takes on 
small risks. If a2 is smaller than the above threshold, then Qi{p) > Q2{p) for all p G (0, 1), 
and it is optimal to have zero insurance = (note that zero insurance implies A = as 
the constraint is necessarily non- binding) . 

The two (finite) possibilities for the deductible level d (with Sx{d) corresponding to the 
unique crossing point of and Q2) are illustrated in Figure 1. The left panel of Figure 1 
shows Case (a), whereby 

^^^^^ = = il + 9)ib, + X)-il + b,)a^9■ ^''^ 

The necessary and sufficient condition for Case (a) to occur is 1/ (3 < P2 < 1/cki, which is 
equivalent to 

. f (g-6i)/? + (l + g)6i-(l + 6i)ai r 

It is possible that the upper bound is negative which implies that case (a) cannot occur as 
A is non-negative by construction. 

Otherwise, we are in Case (b) shown on the right panel of Figure 1, where 

^""^"^^ ^ ^ e{i + b,)a, - 61(1 + e|+ A[e/3 - (1 + e)] ■ ^^^^ 

Case (b) requires that 1/0:2 < P* < or 

bi{l + 9)-9{l + bi)ai + {0-bi)P &i(l + 9)- 9{1 + bi)ai + {9- bi)a2 

{P-l){l + 9) ^ ^ ^a2-l) + 9{P-l) 

Example 4. We keep the above notation but now suppose that /9 > 0:2 > ai > 1; that is, the 
regulator is the most risk averse, the buyer is moderately risk averse, and again, the insurer 
is the least risk averse. 

Let A > be a Lagrange multiplier for this problem. We continue to assume ^ > 61 + A. 
The risk functions to compare arc the same as in (30) but their relation has changed, as 
illustrated in the bottom panel of Figure 1. In particular, it is now possible that Qi and Q2 
cross twice in the interior of (0, 1), so that the optimal contract may be a capped deductible. 
Specifically, in the latter case 

f^{x) = (x — di)+ A d2, where d2 = (pI) 
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from (31) and 



(1 + e){h + A) - (1 + bi)aie + a2{e -h-x) 



subject to the feasibility constraints < Sx{di) < l/a2 and l/a2 < Sx{d2) < 1/ai. 
Translating these into constraints for A we find that 

max — Oi, — T- < A < mm [ — bi, 



P9 + a2-{l + 9)J V ' a2{l + 9)-{l + 9)^ 

This situation is illustrated in the bottom panel of Figure 1. Note that because a2 > ai, if 
there were no constraints, then the optimal insurance would be deductible insurance. 

Note that with such a contract, the regulator's risk level takes the form Hh-^[f^[X)) = 
Sx{d2) — Sx{di) (since < Sx{di) < Sx{d2)). For instance, taking a i = 1.1, a2 = 1.5, /3 — 
2,9^ 1.2, 6i = 0.3, and A = 0.18, we obtain Sx{di) = 0.5143 and Sxid2) = 0.7636, so that 
the insurer only covers the 23 — 48th percentiles of the risk. Since the constraint is binding, 
B = Hhi{f^{X)) = 0.249, and looking back we can interpret this as saying that the insurer 
is allowed to cover at most 24.9% of the risk. Observe that even though the regulator is 
having a lot of impact on the optimal contract (the constraint is binding) , the risk aversion 
of the insurer himself ai still plays a role in the shape of the insurance contract. 



6. Minimizing the Risk of the Buyer subjegt to a Constraint 

To further explore the imphcations of constrained risk sharing, we consider a slightly 
different example in which the buyer is the only minimizing agent. This is the usual insurance 
setting whereby the insurer offers a menu of contracts and the buyer selects the one most 
suited to her needs. Thus, the optimization is from the buyer's point of view; the insurer's 
risk preferences enter the problem through the insurance price. 

Assume that the buyer's risk-adjusted loss after obtaining insurance is {1 + b)Hg{X — 
f{X)) + (1 + 9)Kf(X), in which the first term represents the residual risk and the second 
term represents the insurance premium. The insurer himself is constrained by regulators to 
Hh{f{X)) < B, so that only a limited amount of risk may be transferred. We ignore the 
desires of the insurer and focus on minimizing the risk-adjusted loss of the buyer subject to 
this constraint. Then, we seek to find a non-decreasing /* that minimizes 

(1 + b)Hg{X - f{X)) + (1 + 9)Ef{X), (33) 

subject to the regulatory constraint 

H,{f{X)) < B, (34) 

for some B > 0. The following proposition is a direct counterpart of Theorem 4. 

Theorem 5. An insurance contract f* that minimizes (33) subject to (34) is determined by 

rVd) = / ^' + b)g{Sx{t)) > (1 + 9)Sx{t) + Xh{Sx{t)), 

^^^^^ to, ifil + b)g{Sx{t))<{l + 9)Sx{t) + Xh{Sx{t)). ^ ^ 

Furthermore, either A = or A > 0, with the latter implying that (34) holds with equality, 
from which we can determine X. 




Figure 1. Risk functions of Examples 3 and 4. The dashed hne represents 
Qi(p) = [(1 + bi)mm{aip, 1) - (1 + e)p + Amin(/3p, l)]/|6i + X - d\, and the 
sohd hne is Q2{p) = [niin(a2P) 1) ~ (1 + In this example, ^ > A + 6i, so 

we have Qi{l) = Q2(l) = —1- Note that both functions arc piccewise linear. 
The crossing points correspond to the tranche levels of optimal contracts. The 
top two panels are for Example 3 (Case (a) on the left, Case (b) on the right), 
and the bottom panel is for Example 4. 



Proof. Fix A > 0. Proceeding as in (13), we have 

(1 + b)Hg{X - f{X)) + (1 + 9)EY + X{Hf,{f\X)) - B) 

POO 

= / [-{l + b)g + {l + e) + Xh]{Sx{t))df\t) + Const. 
Jo 

Thus, to minimize (33) wc should set (/■^)'(t) = when the integrand is positive, and 
fit) = 1 when the integrand is negative, which is equivalent to (35). To find A, we solve for 
Xj^h{Sx{t))df\t) = B. □ □ 



To be concrete, take g{p) = mm{ap, 1) and h{p) = min(/3p, 1), in which a > j3 > 1 so that 
the buyer is more risk averse than the regulator. Also, suppose the loss X is exponentially 
distributed with mean equal to Then, for a given Lagrange multiplier A > 0, we find 
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to minimize 

[-(1 + b)e''' + Xe"' + (1 + e)] e-"^ df{t) 



I 





i Ina 



n — ma 

+ r [-(1 + b)e''' + A/5 + (1 + 9)] e-'^* df{t) (36) 

/■oo 

+ / [-(l + 6)a + A/3+(l + ^)]e-^*d/(t). 

Ina 

Prom (36), we consider the following cases: 

Case 1: If -(1 + 6) + A + (1 + ^) = A + ^ - 6 < 0, then all the integrands in (36) are 
negative, which imphes that f^{x) — x. If S > (1 + ln/?)//i = Hh{X), the constraint is not 
binding, and full insurance is optimal. Else, if 5 < Hh{X), then the constraint binds, 
which implies that full insurance cannot be optimal. 

Case 2: If A + ^ - 6 > and -(1 + 5)/? + A/3 + (1 + ^) < 0, that is, 

6 - ^ < A < (1 + 6) - (1 + 9)/p, (37) 

then f'^{x) — {x — d)+ for some deductible d G [0, (ln/?)//x]. Specifically, d = (l//i) In (j^^^^)- 
In this case, we have Hh{{X — d)+) — {1 + In (3)/ fj, — d. We have two subcases to consider. 

a: If iiB > 1 + In then the constraint does not bind (that is, A = 0), which 

implies that d = (l//i)ln [j^) > For this to happen, we also need to satisfy 

(37) which reduces toO + ^ — 6>0 and —{l + b)l3+{l + 9) < 0, or equivalently, 

b <9 < {l + b)l3 -1. 

b: Else, if fiB < l + ln ^ ^^i^g^ ^ , then the constraint binds, and we have A > 0. Specifically 
A = (1 + 6) - ^ e^^-^ To satisfy (37), we need fiB > 1 and fiB < 1 + In/?. Recall 

that iiB < 1 + In (^^^j^^ in this case; by comparing the latter two upper bounds on 
fjiB, we find that Case 2b occurs if (2bl) ^ < 6 and 1 < /x5 < 1 + ln/3; or if (2b2) 9>b 
and 1 < /xS < 1 + In (^4^) ■ Finally, d = -S + (1 + In /?)/// > in either situation. 

Case 3: If A + e - 6 > 0, -(1 + b)(3 + X(3 + (1 + 9) > 0, and -(1 + b)a + X(3 + (1 + 9) < 0, 
that is, 

6 - ^ < A and (1 + 6)/3 - (1 + ^) < A/3 < (1 + b)a - (1 + 9), (38) 

then f^{x) — {x — d)+ for some deductible d e [(In /?)///, (In a)///] . Specifically, d — 
(l//i)ln In this case, we have Hni{X - d)+) = Pe-^'/fi = We 

have two subcases to consider. 

a: If ijlB > ^^j^, then the constraint does not bind, and we have A = and d — 
(l/fj,) In (i±f ). To satisfy (38), we require 6 < ^ and /3(1 + 6) < (1 + ^) < a{l + b). 



Summarizing, Case 3a occurs if 6 < ^, P{l + b) — 1<9< a{l + b) — 1, and /iB > ^j^- 
b: If ijlB < ^YW^^ then the constraint binds, and we have ^ — ^ — ^ and d — 
(l/n) In (^^) . To satisfy (38), we require P/a < ^B < 1 and ^B {{b - 9)13 + (1 + 9)) < 
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{1 + b)p. By considering possible values of 9 and comparing with the above bounds, 
we find that Case 3b occurs when 

e<{l + b)(3-l, and (3/a<i^B<l; or 

(l + b)(3-l<e <(l + b)a-l, and (3 / a < /iB < ^^-^^ . 

1 + 

Case 4: li -{l + b)a + \p+{l + 9) = 0, then A = {{l + b)a- {1 + 9))/ (3, from which it follows 
that X + 9 - b > and -{1 + b)/3 + + {1 + 9) > 0. Thus, the first two integrals in (36) 
are positive, which implies that (/^)'(t) = for i < (In a)///. Moreover, the third integral is 
identically zero, so we have infinitely many possible solutions This degeneracy arises due 
to the piecewise linear form of the AVaR distortions we selected. Within this framework, we 
have two subcases to consider. 

a: If = (1 + 6)q; — 1, then A = 0, and the constraint does not bind necessarily. Thus, 
is given by (/^)'(t) = for i < (In a)/// and arbitrary (/^)'(t) G [0, 1] for t > (In a)/// 
such that Hh{f\X)) < B. 

b: If 6' < (1 + b)a — 1, then A > 0, and the constraint binds. Thus, is given by 
{f^)'{t) = for t < (lnQ;)/;U and arbitrary {f^)'{t) E [0,1] for t > (lnQ;)/;U such that 
Hh{f^{X)) = B. We give some examples to illustrate possible indemnity functions 

i: Let f^{x) = {x — d)^ with deductible d = (l//x)ln ^^j- Note that d > (lna)//x 
if and only if fiB < (3/a. 

ii: Let /^(x) = r{x — (lna)//i)_)_ with proportional coverage r = iiBa/ (3. Note that 

r e [0, 1] if and only if jiB < (3 /a. 
iii: Let f^{x) — min {r'{x — d')+, m — d') with d' and r' given such that d' > (In a)/// 

and f^Be'^'^'/P < r' < 1, from which it follows that m = (1///) In (^^^jz^^) > d' . 

Case 5: IfA + 6'-6>0 and -(1 + b)a + A/5 + (1 + 6*) > 0, then = because all three 
integrals in (36) are positive. In this case, the constraint does not bind, and we necessarily 
have A = 0. Thus, if ^ > (1 + b)a — 1, then /* = /'^ = is optimal; that is, any amount of 
insurance is too expensive relative to the benefit that the buyer obtains from it. 

See Table 1 for a summary of these results as a function of the risk loading parameter 9 
and regulator's constraint B. 

7. Summary and Conclusions 

In this paper, we proved that (Pareto) optimal risk sharing contracts take the form of 
deductible insurance in the setting of agents endowed with distortion risk measures and linear 
transaction/premium costs. Such results continue to hold under third-party constraints. This 
conforms to real-life insurance contracts both in a two-agent case (for example, casualty 
reinsurance) and in a multi-agent setting (credit derivatives based on tranches). 

It would be interesting to extend our results to more general setting, in particular indiffer- 
ence measures based on Rank Dependent Expected Utility (RDEU, also known as Maximin 
Expected Utility and Savage preferences). A tractable example is the exponential-distortion 
risk measure, see [35]: 

H{X) = ^ln|y"° {g[S,,Y{t)] - 1) dt + £ g[S,,Y{t)]dt^ . (39) 
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e>{l + b)a-l 


B>0 


Case 5 d — +oo 




A = 







0^{l + b)a-l 


B>0 


Case 4a non-unique optimum 




A = 







{l + b)l3-l<e <{l + b)a-l 


jdB < (3/ a 


Case 4b non-unique optimum 


A = ((l- 


f b)a 


-(1 


+ e))/p 


(3 /a < < 


Case 3b = (l//i) In 


A = 


1+6 

IJ,B 


i+e 

f3 


> 


,B>^^ 


Case 3a = (1/^) In (i±f ) 




A = 










I^B < (3/ a 
P/a < iiB <l 
l<^xB<l + \n{!^ 
/.i?>l + ln( ^(^+^) 



Case 4b 
Case 3b 
Case 2b2 
Case 2a 



non-unique optimum 
d=(l//.)ln(^) 
d=-B + 



l+ln/3 



A= i^-iM>o 
X = {l + b)-^-f e'^^-i 
A = 



^ < b 



liB < 13/ a 

13/a < liB <l 

l<^B<l + \n(3 
//S > 1 + In /3 



Case 4 non-unique optimum 

Case 3b d = 

Case 2bl d 

Case 1 d = 



A 



;v/^)in(A) 
-5 + 1+1"/? 



= ((l + 6)a-(l + ^))//3 
A = (l + 6)- 



A = 



iM > 

i+e ^imB- 
^ 



Table 1. Classification of Pareto optimal allocations of example in Section 6. 



Note that H is similar to (4) but also features the exponential utility u{x) = —e~'^^. The 
preferences induced by H can be seen in the context of robust utility, where the parameter 
7 is interpreted as the risk aversion coefficient, while the distortion function g corresponds 
to ambiguity-aversion. 

One can show that i7 is a law- invariant, convex risk measure. However, compared to our 
model, H is no longer coherent or comonotone additive. Nevertheless, by Remark 4 our 
analysis up to Theorem 3 still applies. However, because the non-linear log-transformation 
in (39) is global, the structure of Theorem 3 does not hold because we can no longer perform 
t-hy-t optimization for the optimal risk allocation /. 

From a general viewpoint, our work confirms previous results of Jouini et al. [25] (and 
originally Arrow [3]) on optimality of deductible insurance. Conversely, it contrasts with 
possibility of proportional risk sharing obtained in Barrieu and El Karoui [4] (and originally 
Borch [7] ) . The key step in our method relies on comonotonicity of Pareto optimal allocations 
due to the consistency of preferences with the stochastic convex order <cx- Thus, we raise 
the conjecture that in the setting of law-invariant convex risk measures, optimal risk sharing 
always leads to insurance that incorporates a ladder of deductibles (both in unconstrained 
and constrained settings). 
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